Modeling Frequent Allophones in Jap

نویسندگان

  • Long Nguyen
  • Xuefeng Guo
چکیده

In this paper, we describe a technique to model frequent allophones in Japanese speech recognition. The Consonant-Vowel syllabic structure (CV) is dominant in Japanese. Based on frequency, the distribution of CV pairs is rather skewed. Isolating out the most frequent allophones through the use of additional phonemes in acoustic modeling can achieve better recognition accuracy. By introducing ten new phonemes for the five most common CV pairs, we achieved a 30% relative reduction in word error rate for spontaneous speech and 6% relative reduction overall for all speech categories in a Japanese broadcast news transcription task.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Allophone-based acoustic modeling for Persian phoneme recognition

Phoneme recognition is one of the fundamental phases of automatic speech recognition. Coarticulation which refers to the integration of sounds, is one of the important obstacles in phoneme recognition. In other words, each phone is influenced and changed by the characteristics of its neighbor phones, and coarticulation is responsible for most of these changes. The idea of modeling the effects o...

متن کامل

Study of the most frequent natural tooth colors in the Spanish population using spectrophotometry

PURPOSE To identify the most frequent natural tooth colors using the Easyshade Compact (Vita -Zahnfabrik) spectrophotometer on a sample of the Spanish population according to the 3D Master System. MATERIALS AND METHODS The middle third of the facial surface of natural maxillary central incisors was measured with an Easyshade Compact spectrophotometer (Vita Zahnfabrik) in 1361 Caucasian Spanis...

متن کامل

Synthesized Fricative ch Specific Features and Influence on Speech Quality Analysis

One of speech synthesis main problems is synthesis of unvoiced fricatives. One of our previously stated conclusions is that consonant x is influenced by before and behind existing phonetic elements. The aim of experiments described in this paper is to evaluate influence of different x allophones for speech intelligibility and automatic speech recognition. In this paper the formal system, which ...

متن کامل

Modeling and recognition of phonetic and prosodic factors for improvements to acoustic speech recognition models

This paper examines the usefulness of including prosodic and phonetic context information in the phoneme model of a speech recognizer. This is done by creating a series of prosodic and phonetic models and then comparing the mutual information between the observations and each possible context variable. Prosodic variables show improvement less often than phone context variables, however, prosodi...

متن کامل

A MetaPhoneme inventory

This paper focuses on the sharing of phonolog-ical information in a multilingual inheritance-based lexicon. It explores the possibility of establishing a phoneme inventory for a group of languages in which language-speciic phonemes function as \allophones" of newly deened meta-phonemes. Danish, Dutch, English, and Ger-man were taken as a test bed and their vowel phoneme inventories were studied...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002